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COMENTS  On  H.  J.  BARNETT'S 
"SPECIFIC  INDUSTRY  OUTPUT  PROJECTIONS" 

A.  W.  Marshall 


Two  comments  come  to  mind  with  regard  to  Dr.  Barnett’s  paper,  "Specific 
Industry  Output  Projections."  The  firet  of  these  concerns  the  appropriate¬ 
ness  of  comparisons  of  the  kind  carried  out  in  the  paper  for  reaching  de¬ 
cisions  ae  to  the  acceptance  or  rejection  of  proposed  forecasting  methods; 
the  second  comment  concerns  an  alternative  method  of  measuring  the 
forecasting  errors. 

It  would  appear,  given  the  comparisons  in  Tables  3  and  4,  based  upon 
the  mean  deviations  of  actual  from  predicted  industry  outputs,  that  pro*- 
Jection  methods  using  input-output  tables  are  not  very  much  better  than 
quite  elementary  "naive"  model  methods.  Indeed  the  multiple  regression 
forecasts  seem  to  be  somewhat  better  than  those  based  upon  input-output 
relations.  Where  "naive"  model 5  have  in  some  sense  to  be  taken  seriously; 
e.g.,  if  asked  to  forecast  output  by  industry,  for  1956  say,  I  would  prefer 
to  use  Barnett’s  multiple  regression  method  rather  than  the  input-output 
method  exhibited  here;  it  is  weiu  \o  keep  in  mind  their  purpose  and  charac¬ 
ter.  They  are  not  intended  to  be  legitimate  alternatives  to  the  model  or 
procedure  being  tested,  but  rather  are  designedly  crude  and  inefficient, 
almost  reductlo  ad  absurd um  constructions  of  economic  models  and  forecasting 
procedures.  They  represent  a  level  of  efficiency  so  low  and  so  easily 
attained  that  any  forecasting  procedure  seriously  proposed  for  operational 
use,  which  cannot  almost  uniformly  do  better  than  they  can,  must  be  re¬ 
jected  ae  being  operationally  unacceptable.  Two  warnings  are  needed  here. 
First,  Barnett’s  multiple  regreseion  model  must  be  conceded  to  be 
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"semi-naive"  in  the  sense  that  even  if  we  were  to  add  additional  variables 
to  the  equations  which  were  thought  to  have  a  special  relevance  for  the 
explanation  of  the  output  of  some  specific  industry,  it  is  unlikely,  due 
to  the  correlation  between  most  econoadc  time  series,  that  continued  large 
reductions  in  the  sum  of  squares  about  the  regression  line  can  be  obtained. 
In  other  words  the  substitution  and/or  addition  of  other  explanatory  varia¬ 
bles  will  probably  not  greatly  increase  the  forecasting  efficiency  of  these 
regression  equations;  we  are  already,  very  likely,  in  an  area  of  diminishing 
returns  to  scale  in  this  sense*  Second,  ^fcbe  kind*  of  ooRpftriMnm  made  in 
Barnett’s  paper  are  very  appropriate  to  decisions  as  to  whether  or  not  a 
certain  method  of  forecasting  should  or  should  not  be  used  in  practice, 
given  its  current  stage  of  development,  but  are  often  of  minor  importance 
with  regard  to  decisions  concerning  the  advisability  of  continuing  develop¬ 
ment  of  these  methods.  Thus  this  type  of  competitive  trial  of  serious, 
though  perhaps  immature,  models  and  forecasting  methods  against  "naive" 
models  should  not  lead  anyone  to  discard,  or  neglect  the  development  of, 
really  promising  techniques. 

As  a  second  comment  I  should  like  to  suggest  an  alternative  and  some¬ 
what  more  natural  measure,  at  least  to  a  mathematician,  of  the  error  of 
prediction  of  the  various  methods  of  projecting  or  estimating  specific 
industry  outputs  in  some  future  year.  In  Tables  3  and  4  Barnett  has  used 


as  his  measure  of  error 


|  Xi(GNP*)  -  Ii(A)  |  , 


where  X  (0*tP*) 


denotes  the  estimated  output,  in  terms  of  either  an  index  number  or  dollar 
value,  of  the  ith  industry  based  upon  the  estimate  GNP*  of  GNP^  and  X^(A) 
denotes  the  actual  output  of  the  ith  industry.  All  of  the  above,  of  course, 
refers  to  some  fixed  year  and  method  of  forecasting.  As  an  alternative  it 
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is  appealing  to  think  of  the  observed  production  by  industry  and  the  projected 
productions  as  vectors  in  n  dimensional  Euclidean  space  and  to  think  of  the 
error  of  the  projection  as  being  the  distance  between  the  two  points.  Each 
vector  has  28  components  in  Barnett’s  case  and  the  distance  between  the  two 
points  (vectors)  is  given  by 


Not  only  is  this  the  more  usual  definition  of  the  distance  between  two 
vectors  but  it  is  also  a  measure  which  fits  in  with  what  would  seem  to  be, 
from  the  statistical  point  of  view,  the  aim  of  research  in  forecasting 
methods,  i.i.,  the  finding  of  minimum  variance  estimates  of  the  future 
values  of  economic  variables.  Prom  this  point  of  view  we  also  see  that 
once  it  is  decided  what  it  is  we  wish  to  forecast,  all  questions  of  further 
disaggregation  resolve  themselves  into  questions  as  to  whether  or  not  a 
particular  disaggregation  reduces  the  variance  of  our  forecasts. 

One  additional  comment  is  to  be  made.  Since  it  seems  to  be  almost 
certain  that  in  the  future  we  will  have  estimates  of  GNP  which  have  con¬ 
siderably  smaller  errors  than  the  estimates  used  in  the  present  paper, 
whose  accuracy  was  compromised  by  secondary  consumption  or  investment  con¬ 
siderations,  some  separation  of  the  total  error  of  the  various  forecasting 
techniques  into  its  component  parts  is  desirable.  Errors  of  the  order  of 
magnitude  made  in  the  GNP  estimates  are  so  large  that  in  some  sense  none 
of  the  methods  obtained  a  fair  trial.  It  is,  of  course,  in  general  desir¬ 
able  to  be  able  to  factor  out  the  errors  contributed  by  the  separate  steps 
in  the  forecasting  methods  since  one  method  may  be  much  more  sensitive  than 
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another  to  errors  in  some  common  component,  say  the  first  component  in  all 
of  the  forecasting  methods  in  Barnett* e  paper,  the  estimate  of  GNP.  This 
factorization  ie  eaeily  carriea  out  in  principle  ae  may  be  seen  ae  follove: 
Let  us  denote  by  l^(GNP)  the  estimated  output  of  the  ith  induetry  we  would 
have  made  if  we  had  known  the  true  value  of  GNP;  then  we  have 


ae  a  measure  of  the  error  we  would  have  made  even  if  we  had  had  the  best 
possible  knowledge  of  the  value  of  GNP.  Aleo,  we  have 

/  28  2\i 

I  (  ^(QJP*)  -  X^GNP)^  j  , 

and  thus  the  total  error  d  is  separated  into  two  factore  of  which  it  is 
the  vector  reeultant.  As  an  example  I  have  performed  this  factorization 
for  Barnett’s  multiple  regreseion  method  where  one  can  easily  obtain  the 
X^(GNP)  from  the  aquations 

Xi(GNP)  ■  ai  t  b^  GNP  *  Cjt 

by  eubstltuting  the  correct  value  of  GNP  (152, 400)  rather  than  the  estimate 
GNP*  used  to  obtain  the  values  in  Tables  3  and  4.  A  similar  factorization 
for  the  input-output  method  would,  of  course,  entail  much  more  work.  Work¬ 
ing  with  the  consumption  model  (GNP*  a  170,000)  and  the  dollar  value  figures 
in  millions  of  1939  dollars  we  obtain  the  resulte  listed  in  Table  1,  below, 
along  with  the  comparable  figure  of  d  for  the  input-output  method. 


f 
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TABLE  I 

COMPONENTS  OF  ERROR 


Multiple  Regression 

1  d  (total  errors) 

5,325 

p*  * 

JA78  — 

(error  given 

3,354 

d2  (GNP*  component 
of  error) 

6,052 

Thus  using  an  estimate  of  GNP  which  is  too  large  by  11.55f  leads  to  an 


overall  increase  in  the  error  of  forecast  of  59.0JC.  The  reader  will  also 
notice  that  the  distance  between  the  two  estimates,  one  based  upon  GNP  and 
the  other  upon  GNP*  are  farther  from  one  another  than  each  estimate  is 
from  the  true  value.  If  we  were  concerned  with  a  two-industry  world  this 
situation  would  be  illustrated  as  in  Chart  1. 


CHART  1 
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